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Abstract 

A continuous-time Markov chain can be constructed as either a homogeneous or inhomoge- 
neous process. If homogeneity is assumed, the resulting chain is formulated by specifying 
time-independent rates of substitutions between states in the chain. In applications, there 
are usually extra constraints on the rates, depending on the situation. If a "model" is for- 
mulated in this way, it is possible to generalise it and allow for an inhomogeneous process, 
with time-dependent rates satisfying the same constraints. It is then useful to require that 
there exists a homogeneous "average" of this inhomogeneous process within the same model. 
This leads to the definition of "Lie Markov models" , which are precisely the class of models 
where such an average exists. These models form Lie algebras and hence concepts from 
Lie group theory are central to their derivation. In this paper, we concentrate on applica- 
tions to phylogenetics and nucleotide evolution, and derive the complete hierarchy of Lie 
Markov models that respect the grouping of nucleotides into purines and pyrimidines - ie. 
Lie Markov models with 62 I 62 symmetry. We also discuss how to handle the subtleties of 
applying Lie group methods, most naturally defined over the complex field, to the stochas- 
tic case of a Markov process, where parameter values are restricted to be real and positive. 
In particular, we explore the geometric embedding of the cone of stochastic rate matrices 
within the ambient space of the associated complex Lie algebra. 
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1 Introduction 



Most of the commonly implemented molecular evolutionary models in phylogenetics are based 
on the continuous-time Markov assumption. For these models, molecular substitution events 
(along an edge of a phylogenetic tree) are ruled by substitution rates. For DNA models - where 
the state space consists of the four nucleotides adenine, cytosine, guanine and thymine - twelve 
substitution rates must be specified for each edge of the tree, and the precise characteristics of the 
process are fixed by constraints on these rates. These constraints define a space of parameters 
that correspond to unknown evolutionary quantities such as base composition and mutation 
rate. As in all applied statistics, there is a trade-off between more complex, realistic models, and 
simpler, tractable models; more complex models can provide very close fits to the observed data, 
but are more vulnerable to random error. A standard assumption in molecular phylogenetics 
is to work with homogeneous Markov chains, where the substitution rates are assumed to be 
constant in time. 

The motivation behind our previous work ( Sumner et al. . 2012al ) was to consider the conse- 
quences of allowing for the possibility that some change in individual substitution rates may well 
have occurred independently across the evolutionary history. With this perspective, the evolu- 
tionary process can still be modeled as an continuous-time Markov chain, but we must allow the 
process to be inhomogeneous, where the rates are allowed to vary as a function of time throughout 
the evolutionary history. This leads to considering evolutionary model classes that are "multi- 
plicatively closed" . For such models, it it is possible to interpret their the time average of their 
inhomogeneous behaviour as a homogeneous proces s within the same model class. Many oft -used 
models, such as the general time-reversible model ( Tavarel 1986t IPosada fc Crandall . 1998 ). are 
not multiplicativcly closed and this deficiency poses a problem for phylogenetic a nalysis in both 



flexibi lity of interpretation and as a potential source of model-misspecification (jSumner et al. 
2012bh . For a multiplicatively closed model, it is possible to continue to model evolutionary 
processes as homogeneous, by interpreting the substitution rates that are fitted a s an "average" 



of the true inhomogeneous process that occurs on each branch of the tree. In iSumner et al. 



(|2012al ). we presented sufficient conditions for multiplicative closure of continuous-time Markov 
chains, which leads directly to the concept of Lie Markov model. These models arise when we 
request that, when extended over the complex field, the set of rate matrices of the model from 
a Lie algebra. This is a technical condition guaranteeing the corresponding set of substitution 
matrices will be multiplicatively closed, as desired. Mathematically, Lie Markov models can 
be re garded as a gene ralisation of other mod el classes, such as equi variant (jDraisma fc Kuttler . 
20081 ) or group-based (ISemple fc Steet [20031) models. 



In Sumner et al. ( 2012al) we discussed the symmetry properties of DNA models under nu- 
cleotide permutations, and noted the statistical relevance of these symmetries to likelihood cal- 
culations. The main result of that paper was a procedure to generate multiplicatively closed 
Markov models with a prescribed symmetry. This has desirable properties in terms of model 
selection. For instance, a biologist may wish that candidate models do not provide any natural 
groupings of nucleotides, and hence the 64 symmetry is appropriate, and it is then a matter 
of choosing how many free parameters are appropriate for the gi ven data set. The co mplete 
hierarchy of Lie Markov models with ©4 symmetry was derived in lSumner et al\ (|2012al ). 

In this paper, we deal with the case of closed Markov models whose symmetry is consistent 
with the grouping of nucleotides in purines and pyrimidines, i.e. AG vs. CT. As will be discussed, 
this motivates us to produce and examine the Lie Markov models with &2I&2 symmetry - the 
permutation subgroup o f 64 that pres e rves th e purine/pyrimidine grouping. At the same time, 
we will go further than Sumner et al. ( 2012al) by exploring the definition of these models and 
investigate the geometrical properties that arise naturally when we deal with the tension between 
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the algebraic formalism of Lie groups, where one works over the complex field, and the stochastic 
constraints of Markov models, where parameter values are constrained to be real and positive. In 
particular, we discuss the geometric embedding of the stochastic rate matrices within the linear 
set of complex rate matrices. These considerations motivate our definition of the stochastic cone 
of a Lie Markov model. Besides its geometrical interest, the stochastic cone is the set of stochastic 
rate matrices of the model and in a practical context is actually the main object of interest. We 
will discuss implementa tion and performance of the models we present here in a sister paper 
Woodhams et all f|2012h 



In Section [2] we recall some of the basic definitions and tools introduced in ISumner et al. 



(|2012al ). We revisit the definition of Lie Markov models, and introduce the concept of the 
stochastic cone of a Lie Markov model. We also recall the basic results on group theory and 
representation theory that are necessary for the development of our results. In Section [3] we 
recall the idea of Lie Markov model with prescribed symmetry given by a permutation group 
G. We introduce the ray orbits of the corresponding stochastic cone, which are the orbits under 
the action of G of the rays of the stochastic cone. In Section |4j we take G — ©2 I 62 and 
decompose the space of rate matrices as a ©2 I 62-module and provide a basis consistent with 
this decomposition. We also determine the isomorphic classes of possible ©2 I ©2-orbits and the 
decomposition of their (abstract) span into irreducible modules. In Section [Sj we give the whole 
list of Lie Markov models with ©2 I ©2 symmetry. Each model is given by exhibiting a basis of 
the corresponding space of matrices as well as the ray orbits of its stochastic cone. Finally, in 
the conclusions we discuss implications and possible research lines for the future. 



2 Preliminaries 



Throug hout this section, we will recall some definitions and basic facts from ISumner et al. 
(2012a)), which we also refer to for some proofs. We keep the assumptions and the notation 
already introduced there. In particular, we work over the complex field C, and for simplicity 
refer to a matrix as "Markov" if the entries in each column sum to one. Later we will discuss 
how our discussion specialises to the stochastic case where the entries must be real numbers in 
the range [0, 1]. This will lead to considering the stochastic cone of the Lie Markov model, which 
will be the set of real rate matrices with non-negative entries outside the diagonal. 

We define the general Markov model OKgmm as the set of n x n matrices whose columns sum 
to one: 

0Jt GM M ■= {M G M„(C) : 6 T M = e T ) , 

where 9 is the column n- vector with all its entries equal to 1, ie. 9 T — (1,1,..., 1). Recall 
that, in homogeneous continuous-time Markov chain, the corresponding Markov matrices occur 
as exponentials M = e^*, where Q is a "rate matrix" and t gives time elapsed. We write 
£gmm = {Q G M„(C) : 6 T Q = T }, to indicate the set of all (complex) rate matrices. We refer 
to a Markov matrix M G 5D?g mm , or a rate matrix Q G £q mm > as "stochastic" if its off-diagonal 
elements are real and positive. 

Under matrix multiplication, the set 

G£i(n,C) := {M G M„(C) : T AI = T , det(M) 7^0}), 

forms a subgroup of the general linear group of invertible n x n matrices with complex entries, 
ie. GLi(n,C < GL(n,C). It contains the matrix exponential of any rate matrix, that is, 

6 2gmm | e Q . q G £ GMM } c GLi(n, C). 
We refer to e s ' GMM as the general rate matrix model. 
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A Markov model 9Jt is some subset 9Jt C VJIqmm of the general Markov model. A Markov 
model dJl is multiplicatively closed if for all Mi,M% £ OJl we also have M\M 2 £ 9Jt. Similarly, 
given a subset £ C &gmm of rate matrices, we refer to e £ as a rate matrix model. It is clear that 
all rate matrix models are Markov models, and we simplify terminology and also and refer to £ 
as a "model" . 

We are primarily interested in rate matrix models dJl = e £ which are multiplicatively closed. 
Suppose that 9JI is "smooth" around the identity matrix 1, in that there exists differentiable 
paths A(t) £ 9Jt with ^4(0) = 1. Then we can define the tangent space at the identity: 7i(9Jt) = 
{A'(0) : A(t) £ 971, A(0) = 1}. Then, if T x (m) forms a Lie algebra, M is multiplicatively closed. 
This means that for all Qi, Q 2 £ Ti(OT) and A £ C: 

1. Q1+AQ2 GTi(SK), 

2. [Qi,Q 2 ] := Q1Q2 - Q2Q1 e Ti(«Dt). 

The first condition states that Ti(CDt) is a vector space, and the second states that Ti(9Jt) is 

closed under "Lie brackets" . 

Presently, we recall from Johnson! (1985) and Sumner et al. ( 2012a ). the Lie algebra structure 



of the general Markov model. To this aim, consider the set of "elementary" rate matrices 
{Li.j : 1 < i ^ j < n}, where Lij is the n x n matrix with 1 in the ij entry, -1 in the jj entry 
and everywhere else. The matrices {Lij}i^j form a C-basis for the tangent space of GLi(n, C) 
and, in particular, we can express any rate matrix Q as a linear sum: 

Q = 22aijLij. (1) 

This is a convenient basis for SLqmm because any rate matrix Q can be written as ([1} and the 
stochastic condition on Q is simply that the coefficients Qfy are real and non-negative. Moreover, 
the equalities 

[Lij,Lki] — (L^ — Lji)(5jk — Sji) — (Lf-j — Lij)(5u — Sji) 

exhibit the Lie algebra structure of £g mm ■ 

Given a vector subspace £ C £gmm, a stochastic generating set for £ is a generating set 
Bz = {L\,Li2, ■ ■ ■ , L4} of £ such that each is a convex linear combination of the L^-, i.e. 
Lk = otijLij where onj > 0. A stochastic basis of £ is a stochastic generating set where all 

the vectors are linearly independent. 



Definition 2.1 (cf. ISumner ~et "Zl (l2012ah l. A Lie Markov model is a Lie subalgebra £ of £gmm 



provided with a stochastic basis. 

Remark 2.2. By an elementary result in linear algebra, any generating set for a vector space can 
be reduced to a basis by removing elements, and hence Definition 12.11 would remain unchanged 
if "stochastic basis" where replaced with "stochastic generating set" . 

We are especially interested in the study of the set of stochastic rate matrices of the model. 
The condition of Definition 12.11 ensures that £ contains enough stochastic rate matrices (see 
Result [1]). It will be useful to give some geometrical interpretation of it. To this aim , we need 



to recall some basic definitions on convex polyhedral cones. Following iFultonl (|1993l ). a convex 
polyhedral cone in W 1 is defined as a set 

C = {Xivi + . . . + X r v r : Xi > 0}. 



4 



Figure 1: On the left, a strongly convex polyhedral cone of dimension 3 with 6 rays (represented 
by arrows). On the right, a convex polyhedral cone which is not strongly convex. 



generated by some finite set of vectors Vi,...,v r in R". Such vectors are called generators of the 
cone C . The reader may note that with this definition, every linear subspace of W 1 is a convex 
polyhedral cone. When a convex polyhedral cone contains no nonzero linear subspaces, it is said 
to be strongly convex. In this case, which has special interest for us, any minimal sys tem of 
generators of the cone is unique up to multiplication with positive scalars (|Fultonl . [1993). The 
rays of the cone are the positive span of each vect or in a minima l system of generators, and they 
correspond to the 1-dimensional faces of the cone (lFultonl . [l993h : see Figure Q] for an illustration. 



Farka's theorem assures that polyhedral cones can be equivalently defined as the intersection of 
half spaces: 

C = {v e K" : (v,Ui) > 0, i = 1, . . . , t} 

where u%, . . . , ut are vectors in M™ and (•, •) is the usual inner product of R n . It follows from this 
that the intersection of any two convex polyhedral cones in R™ is again a convex polyhedral cone. 
The dimension of the cone C is defined as the dimension of the linear space M.C = C + (— C) 
spanned by C; i.e. dim(C) := dim(KC). Of course, since a set of generators of a cone C is also 
a system of generators of the linear space RC, we conclude that the number of rays of a cone is 
bigger or equal than its dimension. 

Returning to our setting, we consider the real vector space £' MM of dimension 12 spanned 
by the elementary rate matrices Lij,i ^ j defined above. We denote by £q MM = {Q = 
Y]j-f.j ctijLij | ot-ij > 0}, which clearly is a convex polyhedral cone in £* MAf . Given a (complex) 
vector subspace £ in £gmm, we consider 

£ + := £ n £ GMM . 

Notice that all the entries of each matrix in £ + are real and non-negative. 

Notation 1. Consider a collection of vectors X = {X±, X 2 , . . .}. In what follows we will use the 
notation FX or (X\, X2, ■ ■ -)f to indicate the linear span of X over the field F, where F = K or 
C. That is, 

FX = (X U X 2 , • . .)f := {A1.X1 + X 2 X 2 + ... + \ q X q : A e F, q > 0}. 

Of course, FX is a vector space, and we can consider V := CX as a complex vector space, with 
some dimension r, or as a real vector space V = M.X + R(iX) with dimension 2r. To distinguish 
these dimensions, we use the notation dimc(T^) = r and dimn(V r ) = 2r. 
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Result 1. £ + = £ n £qmm * s a strongly convex polyhedral cone in £q Mm . The dimension of 
£ + as a cone is less then or equal than the complex dimension of £, and the equality holds if and 
only if £ has a stochastic generating set. 

Proof. The set £ + is the intersection of two convex polyhedral cones, so it is also a convex 
polyhedral cone. Moreover, being contained in £q A/m it is clear that it contains no linear 
subspaces, so it is strongly convex, as required. Now, to show that the dimension of £ + is less or 
equal than the complex dimension of £, consider the vector space C£ + and observe that it is a 
subspace: <C£ + C £. This implies that dimc(C£ + ) < dimc(£), and since £ + contains only real 
vectors, we have dimc(C£ + ) = dimR(R£ + ) = dim(£ + ) < dimc(£), as required. Now, assume 
that £ has a stochastic generating set B% so that B% C £ + and CB^ = £. As B% contains 
only real vectors, we have diniR(Ri?£) = dime (€!£?£) = dimc(£); and because £ + contains 
only real vectors and £+ C £, we have B 2 C £+ C RB & , so RB £ = R£+. Together this 
implies diniR(Ri?£) = diiriR(K£ + ) = dim(£ + ) = dimc(£). Conversely, suppose that dimc(£) = 
dim(£ + ). Take a generating set for R£ + composed of vectors in £ + ; by removing vectors in 
this generating set, we can always assume they actually form a basis B C £ + of R£ + . Now 
consider the vector subspace CB C £ and observe that dimc(C-B) = dimR(R£>) = dimj{(R£ + ) = 
dim(£+) = dim c (£). Thus CB = £, as required. □ 

Remark 2.3. Assume £ is a Lie algebra without a stochastic basis and take £' = C£ + as the 
complex span of the stochastic cone £ + . Since £' is a complex vector space with a stochastic 
basis, the result above shows that its complex dimension equals the dimension of £ + , which is 
strictly smaller than the dimension of £. Moreover, notice that R£ + is closed under the Lie 
bracket, since £ is a Lie algebra and the Lie bracket of matrices with real entries still has real 
entries. Since £' is generated by R£, it follows that £' is a complex Lie algebra. We conclude 
that if £ is a Lie algebra with no stochastic basis, then we can construct a strictly smaller Lie 
algebra £' with a stochastic basis and with the same stochastic cone than £. This fact justifies 
Definition 12.11 By requesting the Lie algebra to have a stochastic basis, we are considering the 
smallest Lie algebra that contains its stochastic cone. Otherwise, we would be lead to a number 
of Lie algebras giving rise to the same stochastic cone. 

Definition 2.4. The dimension of a Lie Markov model is the dimension of £ as a complex 
vector space (which by virtue of Result Q] equals the dimension of £ + as a cone). The stochastic 
cone of £ is the convex polyhedral cone £ + and the rays of the model are the rays of £ + . 

Remark 2.5. It is important to note that not every stochastic generating set of £ is a set of 
generators of the cone £ + . If this is the case and the set of generators is minimal, the positive 
linear span of each generator is a ray of the cone. 



Background on group representation theory 

In what follows we recal l basic result s from the representation theory of permutation groups 
G < & n . We recommend ISaeanl (l200lh as an excellent introduction to the required material. 



A (linear) representation of a group G is a group homomorphism p : G — > GL(V) = GL(m, C), 
where V is a C-vector space of dimension m. In this situation, p provides an action of G on V, 
and we say that V forms a G-module. A representation is said to be irreducible if it does not 
contain any proper G-submodules. 

Let G < & n be a permutation group of n elements subgroup. Write {Vi}i=i....j for the 
irreducible G-modules and pi : G — > GL(Vi) for the corresponding group homomorphism. Since G 
is finite, any representation p : G — > GL(V) is completely reducible and there is a decomposition 



G 



©4 


{4} 


{31} 


{2 2 } 


{21 2 } 


{I 4 } 


Q 


id 


sgn 


Ci 


c 2 






1 


3 


2 


3 


1 




1 


1 


1 


1 


2 


[(12)] 


1 


1 





-1 


-1 


[(12)] 


1 


-1 


-1 


1 





[(123)] 


1 





-1 





1 


[(12)(34)] 


1 


1 


1 


1 


-2 


[(12)(34)] 


1 


-1 


2 


-1 


1 


[(13)(24)] 


1 


1 


-1 


-1 





[(1234)] 


1 


-1 





1 


-1 


[(1324)] 


1 


-1 


1 


-1 






Table 1: The character tables of 6 4 and£ = {e, (12), (34), (12)(34), (13) (24), (14)(23), (1324), (1423)}. 
The rows are labelled by the conjugacy classes and the columns are labelled by the irreducible 
characters. 



of V into irreducible parts that we call isotypic components, so we can write (Maschke's theorem): 

V = ®Li<kVi, 

where the Cj are non-negative integers specifying the number of copies of the irreducible module 
Vi in the decomposition of V. 

Exa mple 1 . If w e take G = ©„, the irreducible representations are indexed by the partitions 
of n (|SagarJ . [200lh . The defining representation of & n is defined on the n-dimensional vector 
space C™ = ({ei}i<i<„) c by a : &i i-> e^uy It is a well known result that it decomposes as 
{n} ® {n — 1, 1}, where {n} is the (one-dimensional) trivial representation and {n — 1, 1} is an 
irreducible representation with dimension n — 1. 

Every irreducible representation Vi of G has a projection operator associated to it, defined 

by: 

e s ::=^£ CTeG X>)a, (2) 

where x l is the character of the irreducible representation Vi, that is, \ l maps any permutation 
of G to the trace of the homomorphism pi(G). Then, these operators project a given module V 
onto its irreducible parts, i.e. @i(V) = CiVt, so that they can be used to compute the Cj as well 
as to identify generators of the isotypic components. 

Of course, we can restrict p to any subgroup H < G according to the following diagram: 



G %■ GL(V) 




H 



making a ii-module of V. By virtue of Maschke's theorem, we can also decompose V into the 
irreducible ii-modules. Recall that the branching rule of G 4- H describes th e decomposi tion of 
the irreducible representations of G when restricted to H (see Chap.V, §18 of lWevll (|l95(j) V For 
example, in Section 6 below we will introduce the permutation group 

G = {e, (12), (34), (12)(34), (13) (24), (14)(23), (1324), (1423)} < 6 4 . 

By applying orthogonality in the character tables of 64 and g (see Table []} and concentrating 
on the conjugacy class [(12) (34)] in 64 compared to the same class in g, it is straightforward to 
derive the group branching rules shown in Table [5J 
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Background on discrete group actions 

Whenever a group G acts on a finite set B = {b\ 1 . . . , & t }, there is a group homomorphism 

A G- orbit in B is a minimal subset B = {b^ , 6j 3 , . . . , bi t } C £> which is invariant under G. That 
is 

oB := \bi , ,,,, , bi , , . . . ,bi . ,,,, } = B, for all er G G, 

l V(ct)(1) ' i p(cr)(2) ' ' i p(cr)(;) J 5 ) 

and B contains no smaller subsets with this property. From this, we can decompose B as a 
disjoint union of G-orbits: 

B = B 1 UB 2 U...liB k . 

The orbit stabilizer theorem states that up to bijective correspondence, every G-orbit has the 
form of the quotient 

G/H = {[a l ],...,[a q }}, [a i ]=a i H, 
where H is a subgroup of G and each a, e G is chosen so that ajH^aiH\ii< j fsee lBogopolski 



(2008J) for example). The operation of G induces an action in the finite set G/H by 

a : GiH i y (erer;)iJ. 

The orbit stabilizer theorem claims that there is a bijection of any orbit B with some G/H for 
some subgroup H < G. Actually, H is the stabilizer of some element x G B: G x := {g G G : 
gx = x}. As G x < G, and there are only finitely many subgroups of G, it is thus possible to give 
a complete list of G-orbits (up to isomorphism) by simply listing all quotients G/H with H < G. 

We recall that for each H < G we can turn the quotient G/H into a G-module by considering 
consider the vector space generated by the cosets of G/H 

(G/H) c = ([e], [cr 2 ], • ■ • , [o-q}) c = {v = ci[e] + c 2 [cr 2 ] + . . . + c g [cr 9 ] : a G C}, 

as a G-module via the mapping 

cr : v = ci [e] + c 2 [cr 2 ] + . . . + c q [a q ] h4- v' = Ci [cr] + c 2 [cro- 2 ] + . . . + c 9 [<rcr 9 ] . 

3 Lie Markov models with prescribed symmetry 



In Section 3 of lSumner et Ml (l2012al) . we learnt that the search for Lie Markov models is signifi- 



cantly simplified by requesting the models to have some symmetry since this reduces a potential 
infinity of models to just a number of special cases. The idea is to rely on imposing symmetry to 
assist in the search for Lie Markov models. Of course, it is expected that the larger the symmetry 
we demand, the easier the analysis will be. On the other hand, if one were to demand that the 
symmetry of Lie Markov models was the trivial group G = {e}, this would amount to taking no 
particular symmetry at all. At this point our methods break down and one is simply back at 
asking for all the subalgebras of &gmm that have a stochastic basis. 

To this aim, recall that the symmetric group ©„ has an action on Zgmm defined on the 
elementary rate matrices as 

p(a) ■ Lij := L lj{i)a(j) , 
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and extended to all of £gm m by linearity. Equivalently, the action can be defined by 



(3) 



where K a is the permutation matrix defined by 



1, if <t(6) = a 
0, otherwise. 



Definition 3.1 (cf. S umner et all (|2012af) ). We say that a Lie Markov model £ has the symmetry 
of the group G < & n if there is a basis £?£ of £ invariant under the action of G induced by ([3]), 
that is, a basis B% = {L\, ■ ■ ■ , L ( i} such that 

cr-Bfi := {K a L 1 K-\K <y L 2 K-\...,K a L d K- 1 }=Bz, Ver e G 

In this case, we will say that B& is a permutation basis of £ 

Remark 3.2. Notice that if £ has the symmetry of a permutation group G, then it also has the 
symmetry of any subgroup H < G. 



The reader is referred to ISumner et al\ (|2012al) for the statistical motivations for this defini- 
tion. Notice that in particular, we have a group homomorphism 



p:G^& d , 

where al is the dimension of the model. For all a 6 G and 1 < i < d, we have K a LiK~ Y = L 
where p(a) G &d- Thus for any rate matrix Q = Xa=i a iLi G £, we have 



(4) 



p(°0W' 



i=l 



i=l 



so G acts by permuting the model parameters, ie. cti tt p ( a -i)(i)- 



Example 2 (jSumner et all (|2012al )). The list of 4-state Lie Markov models with ©4 symmetry 
is: 



1. Jukes-Cantor model, with dimension 1 ( Jukes fc Canton . 1196 



2. Kimura model, with dimension 1 (jKimural 119811 ): 



3. Felsenstein model, with dimension 4 (jFelsensteinl . 119811) : 

4. Kimura+Felsenstein model or "K3ST+F81", with dimension 6 fsee lSumner et all (|2012bf) . 
and Example 2] below); 

5. General Markov model, with dimension 12. 

Presently, we recall t he general procedure to obtain Lie Markov models with prescribed 
symmetry, as presented in lSumner et all (|2012al ). Suppose we have a Lie Markov algebra £ with 
dimension d and a permutation group G < & n . We demand that £ satisfies the conditions 
of Definition 13.11 for the permutation group G. Then, £ is provided with a basis B% which is 
invariant under G. As explained above, we have a decomposition of B^ into G-orbits. We can 
then compare the irreducible G- modules that occur in the decomposition of £,qmm to those that 
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{4} i ^ id 
{I 4 } H. sgn 
&ilG- {31}^C + C 2 

{2 2 } h4 id + sgn 
{21 2 }^g + d 

Tabic 2: The branching rule of ©4 to 



occur in the decomposition of (G/H) c for each H < G. Finally, we can attempt to construct 
subalgebras £ C £gmm with a basis such that = B\ U B2 U . . . U i3 r is a plausible union 
of orbits Si that are consistent with the linear decomposition of £g mm induced by the action of 
G. 

The general procedure is then: 

1. Decompose the Lie algebra of the general Markov model into irreducible modules of G: 

&GMM = ©fc/fcVfc, (5) 

where k labels the irreducible G-module Vk and the fk are non-negative integers specifying 
the number of copies of each irreducible module in the decomposition. 

2. Apply the orbit stabilizer theorem and construct the list of G-orbits, G/H, by working 
through the subgroups H < G. For each subgroup H, extend the orbits linearly over C to 
the G-module (G/H) c and decompose this space into irreducible G-modules: 

(G/H) c = (B k b^V k , 

where again the b k are non- negative integers. 

3. Working up in dimension d, consider all unions of G-orbits 

S = (G/H!) U (G/H 2 ) U . . . U (G/H q ) 

such that \S\ = J2i<i< q = ^ (where | ■ | stands for cardinality). For each S, consider 

its linear decomposition into irreducible G-modules: 

(S) c = ®kdkVk 

H H H 

where ak ■= b k 1 + b k 2 + . . . + b k q , and, in order to exclude unions of G-orbits that do not 
occur in the linear decomposition of Zgmm as a G-module, check that ak < fk, for each k. 

4. For each case thus identified, consider the vector space £ = ffifeafeVjt and use explicit 
computation to check whether £ forms a Lie algebra. In this case, attempt to show that it 
has a stochastic basis. 

This procedure is guaranteed to produce all Lie Markov models with symmetry G. In Sec- 
tion [5l wc will give a complete presentation of the 4-state with ©2 I ©2 symmetry. 



Remark 3.3. Equivari ant models were first introduced in iDraisma k, Kuttlerl (12008 ) and h ave 
been widely studied in ICasanellas fc Fernandez- Sanchez! (|2010l) . In ISumner et all (|2012ah we 
modified slightly the definition to adapt it to the continuous-time Markov model setting. Under 



10 



this definition, equivariant models appear as a particular case of Lie Markov models. Actually, 
the G-equivariant model is the Lie Markov model with d ecomposition fidid id© id© ... © id, 



where we keep the notation of For example, in ISumner et al\ (|2012al ) we showed that 

the Lie Markov model with symmetry ©4 and decomposition id is just the Jukes-Cantor model 
(IJukes fc CantorL ll969h. n a similar way, we will recover the Kimura model with two parameters 



Kimuralll980(r as the Lie Markov model with symmetry G = ((12), (1324)} and decomposition 



2id (see model 2.2b in Section [5]). 
The stochastic cone of a Lie Markov model 

We want to explore the geometry of the stochastic cone associated to a Lie Markov model 
with prescribed geometry given by some permutation group G < & n . Since the action of G 
on Zgmm is as given in we infer that the space £qmm 1S invariant under this action, i.e. 
G £qmm = ^gmm- F rom this, we conclude that if £ C £gmm is a vector subspace which is 
invariant under the action of G, then the stochastic cone £ + = £ Pi £qmm * s invariant under G 
as well. 

Because each permutation in G induces a linear automorphism in &gmm and the cone £+ 
is invariant, the set of rays of the cone must also be invariant under the action of G. We infer 
that, after giving an ordering to the set rays, there is a group homomorphism 

G^& r , (6) 

where r is the number of rays of £ + . From this, we can decompose the set of rays of £ + into 
G-orbits, which we will refer to as ray orbits. Notice that in general, the above homomorphism 
is different from the homomorphism arising from a permutation basis, as described in (|4"|). 

Remark 3.4. Being a G-orbit, we can consider the abstract vector space generated by any ray 
orbit B = {Q u ...,Q r }: 

(B) C = :Oi6c|, 

where the notation [Qi\ is used to emphasise the fact that we are avoiding any reference to 
matrix addition between the elements of the ray orbit. The dimension of this vector space equals 
the number of elements in the orbit, and as a permutation representation, the decomposition 
into irreducible representations will be one of the decompositions shown in Table |H On the 
other hand, these G-orbits are composed of rate matrices, so we can also consider the vector 
subspace of £gmm spanned by these matrices. However, generators of the rays may not be 
linearly indepenedent as vectors of Zqmm and the dimension of this vector space may be smaller 
than the number of them. This vector space may not be a permutation representation and the 
decomposition of it into irreducible representations may not appear in Table [4] For an example 
of this, the reader is referred for example to ray orbits 424a, 4246, 424c presented in Table HJ 

Example 3. The number of rays of £qmm ^ s These rays are exactly the elementary rate 
matrices L^. The group homomorphism G — » & r of ([5]) corresponds to the action described in 
©• 



Example 4. In Result 17 of lSumner et al\ (|2012al) we learnt that there is only one six-dimensional 



Lie Markov model with ©4 symmetry. The Lie algebra £ is the vector space sum of the Kimura 
3ST and Felsenstein 81 models. It is generated by 

W i j=L s{m + (R i + R j ), i<j, i,j e {1,2,3,4}, 
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where R a = J2b^a Lab arid L a (y) = Lij + Lji + Lf-i + Lif. with k, I all different. The reader 
may notice that although the 6 vectors Wij do form a permutation basis of £, by taking the 
convex cone generated by them: 

{^A,.U, ; A, , ()}. 

we are not considering all the stochastic rate matrices in the model. For example, the vector 
Ri is in the stochastic cone £ + but we cannot obtain it as a positive linear combination of the 
vectors Wij . The reader may argue that this situation occurs because of our particular choice of 
a permutation basis, but the fact is this will be the case no matter the permutation basis of £ 
we consider. Actually, the stochastic cone £ + has seven rays {L a , Lp, L 1 , R\, i? 2 , R3, R4} (with 
the notation used there: L a — L s (vz)i Lp = L s ( 13 ), L 7 = L s (i2))- We will find this model again in 
Section [5] of this paper as a Lie Markov model with symmetry given by the permutation group 
((12), (1324)} < 64 (model 6.7a). 

Remark 3.5. One may wish to explore expanding the current definition of Lie Markov model 
with prescribed symmetry and accept any Lie algebra which is invariant under the action of the 
prescribed permutation group, rather than requesting the existence of a permutation basis as 
in Definition 13.11 This relaxed condition would be equivalent to requiring the existence of an 
invariant stochastic generating set for the stochastic cone £ + , not necessarily a basis for £. This 
discussion leads to an interesting topic of research that we discuss further in the conclusions 
section of the paper. 



4 Decomposition of £gmm as a 62! 62 module 

As we are especially interested in nucleotide evolution, we fix n — 4 and deal with the permutation 
group of ©4 of nucleotide permutations that preserve partitioning into purines and pyrimidincs: 
AG\CT := {{A,G},{C,T}}. By identifying nucleotides {A,C,G,T} with numbers {1,2,3,4}, 
this leads to consider the subgroup of ©4 generated by (12) and (1324): 

G := ((12), (1324)} - {e, (12), (34), (12) (34), (13)(24), (14)(23), (1324), (1423)}. 

Note that Q is isomorphic to the wreath product 62 ( 62 (see Chapter VII of iRotman 1 19951) . 



for instance) of 6 2 with itself, and also t o the dihedral group D s - the symmetries of a square. 
As was done in Sumner fc Jarvid ([2009) , this group can also be regarded as the subgroup of 



permutations of ©4 preserving the topology of the quartet tree presented by the split 12|34. 
Other identifications of nucleotides with numbers {1,2,3,4} lead to the two other copies of 
62 I ©2 that occur in ©4, preserving the topologies 13|24 and 14|23: 

©2 I 62 = {e, (13), (24), (13)(24), (12)(34), (14)(23), (1234), (1432)}, 
<* {e, (14), (23), (14)(23), (13)(24), (12)(34), (1243), (1342)}. 

Presently, we use the projection operators to decompose the Lie algebra of the general Markov 
model into the irreducible representations of Q = © 2 1 ©2- Note that the group Q has 5 conjugacy 
classes: 

N = {e} (7) 
[(12)] = {(12), (34)} 
[(12)(34)] = {(12)(34)} 
[(13)(24)] = {(13)(24),(14)(23)} 
[(1324)] = {(1324), (1423)} 
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Recall that the irreducible representations of a finite group can be put in one-to-one correspon- 
dence with its conjugacy classes, and the sum of the dimension of each irreducible representation 



squared is equal to the order of the group (see ISaeanl (|2001l ) for example) . We conclude that 



there are five irreducible representations of Q, that will be denoted by id, sgn d, (2 and £, with 
the corresponding character table is given in Table [TJ Notice that the first row in the character 
table gives the dimension of each representation. Notice also that there are four one-dimensional 
representations, namely id (the trivial or identity representation which maps each permutation 
to the identity 1), sgn (each permutation a is mapped to sgn(cr)), £1 and £2- Besides these, the 
representation £ is two-dimensional. The rows of Table [1] represent the conjugacy classes of Q. 

From now on, we will consider the restriction of the action of 64 described in (J3J) to the group 
Q. We will denote this action by pg: 

Pq{o) : Q h-> K a QK~ x . (8) 



In Result 8 of ISumner et ali (|2012al) we learnt that the decomposition of the £qmm into the 



irreducible representations of ©4 is 

&GMM = {4} © 2{31} © {2 2 } © {21 2 }. 
By applying the branching rule of ©4 to Q (see Table ^ we obtain: 

Result 2. The decomposition of the J^-state general rate matrix model &GMM into irreducible 
representations of Q is given by 

S.gmm ^2idffisgn©Ci©2C 2 ©3£, (9) 

where the decomposition of dimension is given by 12 = 2x1 + 1 + 1 + 2x1 + 3x2. 

4.1 Decomposition of the orbits of Q in Qgmm 

Following the general scheme described in Section [31 our task now is to identify the Lie Markov 
models occurring as subalgebras of Zgmm and with symmetry Q. In Table [4] we present the 
decomposition of the orbits of Q. These are computed by using the orbit stabilizer theorem and 
projecting (Q/H) c onto the irreducible module Vj, of Q using the projection operator 0^. 

Example 5. Here we develop the case of H = ((12)(34)) as an illustrative example. We have 

g/7J = {[e],[(12)],[(13)(24)],[(1324)]}, 

where [a] represents the coset in Q/H containing the element a. Namely, 

[e] = {e,(12)(34)}, 

[(12)] = {(12), (34)}, 

[(13)(24)] = {(13)(24),(14)(23)}, 

[(1324)] = {(1324), (1423)}. 

These cosets inherit an action of Q by taking a : [a 1 ] n> [erer'] , which can be extended linearly to 
a linear representation of Q by taking the module 

(Q/H) c = <[e], [(12)], [(13)(24)], [(1324)]) c £ C 4 . 
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Next, we decompose (G / H) c into irreducible modules of Q by applying the projection operators: 
@id, @sgn, @Ci' ®C2; ©£■ F° r example: 

©idN = | E CTee *" • [e] = i ([e] + [(12)] + [(13)(24)] + [(1324)]) . 

As this projection is non-zero, we conclude that (Q/H) c contains the trivial representation id. 
We can check that the image by Q id of the other coset elements gives the same projection, so 
(Q/H) c contains id only once. Similarly, referring to the character table of ©4 (see Table [lj, we 
have 

©sgn[e] = I E. ee X sg » • M = i ([e] - [(12)] + [(13)(24)] - [(1324)]) 

and we check that 6 sgn [e] = 6 sgn [(12)] = sgn [(13)(24)] = 8 sgn [(1324)] to learn that (G/H) c 
does contain a copy of the sgn representation. Similarly, we check that (Q/H)c contains a copy 
of £1 and C2 representations. On the other hand, we see that 

6 C [e] = |E. e aX«(O-[e] = HN-[(12)(34)]) = 

and we check that 6^(12)] = 6 S [(13)(24)] = 9 £ [(1324)] = to learn that (G/H) c does not 
contain a copy of the £ representation. Putting this together and counting dimensions, we infer 
that 

(G/H) c ^ id © sgn © Ci © (2- 

Proceeding as in this example, we have produced the results summarised in Table HI It gives 
the decomposition of (G/H) c into irreducible representations for each subgroup H < G- The 
second column shows how many copies of each H occur as a subgroup in G, with non-isomorphic 
copies accounted for with distinct decomposition in the fourth column. For example, there are 
three "types" of 6 2 in G- 

{e,(12)} = {e,(34)} = Z 2 ; 
{e,(12)(34)} = Z 2 ; 
{e,(13)(24)}-{e,(14)(23)} S Z 2 . 

These three types are structurally different as subgroups of G and as a result, the correspond- 
ing spaces (G/H) c have different decomposition into irreducible modules, as shown in Table |U 
Similarly, there are two "types" of 62 x 62 : 

{e,(12),(34),(12)(34)} = 6 2 x 6 2 , 
{e,(12)(34) > (13)(24),(14)(23)} = 6 2 x 6 2 . 

Again, these two types have differing decomposition into irreducible subspaces. 

Finally, Table[3]shows all possible decompositions for a ^-invariant subspace of &GMM allowed 
by the decomposition of £gmm 01 Result [2l The list is obtained by adding decompositions of G- 
orbits (see Table 2]) as long as they are allowed by the decomposition of Zgmm as a (/-module (see 
Result [2]) • Note that the decomposition © of £gmm has two copies of the trivial representation 
while the decomposition of each G/H has only one copy. Referring to Table |21 we conclude the 
following 

Result 3. In the 4-state case, there are no Lie Markov models with © 2 ; © 2 symmetry with 
dimension seven or eleven. 
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Dimension 


Decomposition 


1 


id 


2 


2id 




id © sgn 




id e Ci 




id © C,2 


3 


2id © sgn 




2id © Cl 




2id © C 2 


4 


id © C2 © £ 




id © sgn © Cl © C2 




id © sgn © £- 




2 id © 2( 2 




2id © sgn © (2 




2id © Ci © C2 



Dimension 


Decomposition 


5 


2id © sgn © C 
2id © sgn © Cl © C2 
2id © C2 © £ 


6 


2id © sgn © Cl © Z 
2id © sgn © Ci © 2C2 
2id © sgn © C2 © 5 
2id © Ci © C2 © xi 
2id © 2C2 © ? 


8 


id © sgn © Ci © C2 © 2C 

2id © 2C2 © 2£ 
2id © sgn © Cl © 2C2 © Z 
2id © sgn © C2 © 2£ 


9 


2id © sgn © Cl © C2 © 25 


10 


2id © sgn © Cl © 2C2 © 2£ 


12 


2id © sgn © Ci © 2C2 © 3£ 



Table 3: All possible decompositions into irreducible modules of the (/-invariant subspaces of 
£gmm — 2id © sgn © Ci © 2C2 © 3£. The list is obtained by adding decompositions of (/-orbits 
(see Table S]) as long as they are allowed by the decomposition of &GMM as a ^-module (see 
Result [5J . Those decompositions with only one copy of the identity representation correspond 
to the irreducible permutation representations listed in Table 21 



4.2 A convenient basis 

We introduce an explicit basis for each module present in the decomposition of Result [5J The 
quotients Q/H can be nicely understood as the ^-orbits of some graphs, so that the corresponding 
permutation representations spanned by Q/H are isomorphic to the linear span of these graphs. 
For example, the representation id is isomorphic to the span of the quartet tree 12(34, as this 
graph remains invariant under the action of Q. Similarly, id+ sgn + £ is isomorphic to the linear 
span of the cherries: ({1, 3}, {2, 4}, {1, 4}, {2, 3}} c . This interpretation is useful and assists to 
obtain a convenient basis for the whole space £gmm- 



Permutation vectors. For each a € G,cr ^ e, Permutation vectors are defined as 

L„ = -l + K n 



For example, we have 



(12) 



/ * 1 \ 
1*00 

0*0 

\ * ) 



^2 L 3°U)' 
1<J<4 



L 



(1423) 



/ * 1 \ 
0*01 

1*0 

\ 1 * j 



where we use * to indicate the diagonal entry needed for the column to sum to zero. Notice that 
each L a is a rate matrix in £q mm ■ The linear span of these vectors has dimension 5 because of 
the linear dependencies 

£(12) + £(34) = £(12)(34), 
£(1324) + £(1423) — £(13) (24) + £(14) (23)- 



15 



Moreover, we have (Proposition 4.12 of lSumner el all (|2012al) ) 



[L a , L,ji] = [-1 + K a , -1 + K g ,\ = [K a , K a ,\ = K aa , - K a , a = L, 



aa 



The permutation vectors are useful because they provide simple expressions of generators of 
£gmm consistent with the decomposition of Result [5] The action pg of Q on these permutation 
vectors is given by: 

r : L a i-)- K T L a K~ x = i T (j>(a(j)) = L TcrT -i. 
i 

Notice that this action maps each matrix L a to L a i, where a' is some permutation in the 
conjugacy class of a (see ([7])). For example, it follows that £(i2)(34) is invariant under this action 
since (12)(34) is the only permutation in its conjugacy class, while L(i3)(24) and £(i4)(23) either 
remain invariant or swap to each other. Proceeding in this way for each conjugacy class of Q 
(but for the trivial class), we identify the following (/-modules and decompositions: 

(£(12),£(34)) c — id©C2, 

(£(12)(34)) C = id, 
(i(i3)(24),^(i4)(23)) c — id©sgn, 

(£(1324) > £(1423)} c = id©Cl- 

Now, by applying the projection operators to them, we find 



\- ^(14)(23)) 
-£(U)(23)» 
£(1324); 
-(34), 

where the superscript indicates which irreducible module of ©2 I 62 each vector belongs to. 

Cherry vectors. Referring to Table |4] and the permutation representation spanned by the 
"cherries" {1,2} and {3,4}, we introduce the matrices 

Chi 2 = L13 + L14 + L23 + I/ 2 4, 
Ch.34 = L31 + L32 + L41 + Li2- 

to obtain 

(Chi2,Ch 34 ) c = id©C2- 

The action of Q on each of these vectors is given by 

r : Chij rChij := L T ^ T ^ k ) + L T ^ T ^ + L T ^ T ^ + L T ^ T Qj — Ch T ^ T ^y 

Notice that Chi2 + Ch34 = B^. By applying the projection operator 8^ 2 , we see that (Chi2 — 
Ch34)c accounts for the second copy of £2- We define 

B2 2 = Ch 12 - CI134. 

Notation 2. From now on, we will use the following notation: 

B id _ B id + gid^ 

B C2 = B^+BJj 2 . 



Bi d 


- £(12)(34), 


B 2 d 


— £(13)(24) " 




= £(13)(24) 




= £(1423) — 


b[ 2 


= £(12) - L 
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£(12)(34) I 



-(34) 



C/l{l,2} 




L14 + L/2 



L3I + -^42 C7l{3,4} L 32 + £4 



Ll3 + L 2 4 



£l3 + £2. 



Wl4)(23) 



(131(24) 




£32 + La 



Ch{3,4} 



Figure 2: Geometric representation of the 5-simplex corresponding to a transversal section of 
the 6-dimensional polyhedral cone generated by L(i2), ^(34)1 -^13 + ^24, £14 + L23, L31 + L42, 
L32 + £41 ■ This representation corresponds to an orthogonal projection of this shape into the 
plane, where the external vertices represent the rays of the cone. It enlightens the relations 
among some of the matrices defined so far and clarifies their geometrical arrangement in the 
ambient space £gmm- The tetrahedral cell represented next to it is the transversal section of 
the cone generated by the vectors {L13 + L24, £14 + L23, L31 + £42, L32 + £41} • 



Row-sum and twisted vectors. Keeping the notation of lSumner et al\ (|2012al) . define the 
row-sum vectors 



Ri 

The action pg of Q on each of these is 
a : R 



^2 Li i ■ 

j:l<i=£j<A 



i-> aR l - Y,j:\<i±j<4 L <y(i)T(j) - Ra(i), 

and it is isomorphic to the restriction of the defining representation of ©4 to Q: a : e, e a uy 
Therefore, the (invariant) subspace generated by the row-sum vectors is isomorphic (as a Q- 
module) to the defining representation of ©4, given by id © {31}, restricted to the subgroup Q. 
By applying the branching rule of Table [2l we obtain 



(i?l,i?2,i?3,#4> C = id©C2©£- 



Obviously, we have Ri+R 2 + R3 + Ra = B{ d + and (R 1 + R 2 ) - (R 3 + Ra) = Bf + B^ 2 . By 
applying the projection operator 0£ we find that {R\ — R 2 , R3 — Ra)c accounts for a first copy 
of £ representation. We define 



B^ = Rt - R 2 

Next, define the twisted vectors as 

Hi = 
Vi = 



R3 — Ra 



Lik + Lu + Lj 
Lki + Lu + Li 
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Vo = 



H 3 = 



where {{i,j},{k,l}} = {{1, 2}, {3, 4}}. For example, 

/ * \ 
1*00 

1*0 

\ 1 * J 

The action pg of Q on these vectors is given by 

a : Vi H- aVi 
a : Hi i — y aH, 



* 








°\ 





* 








1 


1 


* 





V o 





1 


* / 



V 



(»)> 



so we deal again with the restriction of the defining representation of ©4 to Q . As above, we 
have that 



(Vi, V 2 , V 3 ,V 4 ) C = {H^H 2 , H 3 , H 4 ) c SS id © C2 © & 



Notice that J^i Hi 
Vi) - (V 3 + V4) = 



Bf 2 



E t V* = Bf + B| d , (ffj + H 2 ) - {H 3 + H^ = Bj 2 + B^ 2 and (Vi 



B^ 2 . By applying the projection operator 6^ in (Vi, V 2 , V3, Vi)c and 



(i?i ,H 2 ,H 3 , Hi)c, we find that {Hi — H 2 ,H 3 — -ff 4 )c and (Vi — V2, V3 — Vi}c account for the two 
other copies of £, so we define 



B^ = — Ha 



H\ — H 2 



Bl = V3 - V4. 



vi - v 2 



Putting all of these results together: 
Result 4. The Lie algebra &gmm of the J^-state general Markov model can be expressed as 

&GMM = {Lij}l<i^j<4 

= ({L a }aeg,a^e U {Chi2, Ch 3 4} U {i?i}l<i<4 U {Hj}i<j<4 U {Vfc}i<fc< 4 )c, 

with linear dependencies 

£ 



-(12) 



£(34) = £(12)(34)> 



L 



(13)(24) 



) - 


^(1324) 


+ £(1423) — 


Ch 12 - 


-Ch 34 


Hi 


+ H 2 = 


Ri + R 2 = 


Ch 12 4 


"£(12) 


H 3 


+ H 4 = 


R 3 + -R4 = 




- £(34) 






Vi + V 2 = 




-£(12) 






v 3 + v A = 


Ch 12 - 


£(34) 



^4 basis for £g mm consistent with the decomposition of Result^ is given by 

B t = Ri — R 2 , 

£(14)(23); B| = i? 3 - i? 4 , 



Bl d — £(12)(34); 



B 2 d 



(13)(24) 



B s s n = L 



(13) (24) 



(14) (23) J 



B| = i?i-i? 2 , 



Br 

B2 2 



C2 _ 



-^(1324) 
£(12) 

Ch 12 



L 



(1423); 



L 



(34); 



Ch. 



34 • 



b! 

Bt = V 1 -V 2 , 



Zf 3 — £?4 

Vi - V2, 
V3 - Vi; 



where (B^B^g, (B3,B|) C and (B|,Bg} c are i/ie three copies of £ in £,gmm- 

By explicit computation we found: 

Result 5. With respect to the basis given in Result^ the Lie algebra structure of Zqmm is 
summarised in Table [5[ 
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5 The list of Lie Markov models with 62 I ©2 symmetry 



We proceed to give the list of Lie Markov models with Q = ©2 ( 62 symmetry, working up 
in dimension d < 12. For each d, Table [3] lists all the possible decompositions allowed by 
Result [5J For each decomposition, all possible complex Lie subalgebras £ of £q m m with that 
decomposition are obtained by direct computation using some code written by t he authors and 



implemented in the open-source mathematical sofware SAGE (jStein et all 120121 ) . For each Lie 
algebra, we then impose that the real dimension of the stochastic cone £ + equals d. By Result [TJ 
this condition is equivalent to the request that the Lie subalgebra is a Lie Markov model. Notice 
that the only vectors in the special basis of Result 0] whose coordinates are all non-negative 
are the two generators of the "identity" module in the decomposition of Zqmm'- B* d and B^. 
In particular, the above condition is guaranteed if and only either £ contains one copy of the 
"identity" representation of the form aB\ d + bBlf, with a, b > or £ contains the two copies B\ d 
and B 2 d . 

For each model in the list, we describe a basis for the Lie algebra in terms of the vectors 
introduced in the Section [4] and the rays of the stochastic cone arranged in orbits (see Table [6]). 
Both of these two data is enough to describe completely the model. The general form of the 
stochastic rate matrix as well as a permutation basis (a basis invariant under the action of Q) is 
also shown when it is not too complicated. In particular, stochastic rate matrices are presented 
as linear combinations of the rays with non- negative coefficients (expressed in Greek alphabet). 
Since the rays are the generators of the stochastic cone, every stochastic rate matrix in the model 
can be expressed in this way (the reader should notice that in general, we cannot write down 
all the stochastic rate matrices of a model in terms of the same permutation basis if we request 
the coefficients to be non-negative). The name of each model has the form (d.r), where d is the 
dimension of the model and r is the number of rays of the corresponding stochastic cone (in 
particular, d < r). In case there is more than one model with a given dimension and number of 
rays, we will differentiate them by using letters: for example, 5.7a, 5.76 and so on. 

Notation 3. Throughout the following list, we adopt the following notation for i, j G {1,2,3, 4} 
and X e {R,H,V}: 

X ij = X i + X j > 

XT = Xi- Xj. 



Ray orbits. The rays of the stochastic cones of the forthcoming models appear in orbits of 
cardinality 1, 2, 4 and 8 (as is demanded by the orbit-stablizer theorem). The generators of the 
rays of these orbits are given in Tabled the vectors of the same orbit are the elements of a single 
C?-orbit in &gmm. Since Q preserves transition and transversions rates, it can be seen that they 
all have the same number of transition and transversion rates. Keeping the notation of ([T]), if 
Q = (ctij) is any element in the same 5-orbit, the values of 

sum of transition rates of Q — a\% + Ui\ + 0:34 + CK43 
sum of transversion rates of Q = a±3 + an + 0:23 + <^24 + CK31 + «32 + CK41 + 0^42 

do not depend on the particular representative Q chosen within the orbit. 

For each Lie Markov model in the following list, we will give explicitly the rays of the corre- 
sponding stochastic cone arranged in orbits. For the sake of simplicity in the notation, we will 
adopt the following convention to denote the ray orbits: each orbit will be called as rsv, where 

• r refers to the number of rays in the orbit: 1, 2, 4 or 8; 
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• s refers to the sum of the transitions rates in (any matrix of) the orbit; 

• v refers to the sum of the transversion rates in (any matrix of) the orbit. 

The reader is referred to Table [5] for the whole list of ray orbits arising in Lie Markov models 
with (/-symmetry. 



Ray orbits of the Lie Markov models with Q symmetry have been also studied in lWoodhams et al. 



(2012). The reader is referred to this paper for more information and for performance studies 
on Lie Markov models with this symmetry. 



Dimension One 

From Table 2] we see that there is only one abstract orbit of Q with cardinality one and decom- 
position id. 



id The general Markov model contains two copies of the trivial representation, (B\ d , B^c, so 
we can consider the subspace generated by any linear combination aB\ d + fcB^. Moreover, 
since 

[Bl d ,B^ = 

we see that the subspace generated by any aB\ d + bB^, a, b 6 C is a Lie algebra. When we 
request these spaces to have a stochastic basis, we have to restrict to the condition a, b > 0. 
Therefore, we conclude: 

Result 6. In the J^-state case, there is a continuum of one- dimensional Lie Markov models 
with Q symmetry and decomposition id. Each model in the family has the form 



£=(B id ) c a,6>0, 



where a + 6 = 1, a, b>0 and 



Bl 



aB id 



(* 


a 


6 


b \ 


a 


* 


6 


b 


b 


6 




a 


V b 


6 


a 


* J 



Remark 5.1. This result is not completely satisfactory as all these models will appear as 1- 
dimensional Lie subalgebras of the 2-dimensional Lie Markov model 



<B* d ,B* d ) 



2id. 



This situation is quite general and we will avoid the consequent redundancy in the present list 
by considering families of Lie Markov models as submodels of a Lie Markov models with bigger 
dimension. Then, the family of models in Result [6] will be regarded as a Lie Markov model with 
decomposition 2id. Therefore, we are not listing such cases and will refer the reader to the bigger 
Lie Markov models. 

On the other hand, notice that if we expand the symmetry and request the models in the 
family of Result E] to have the symmetry of the group o f 64, we are lead to the particular choice 
a = b, which corresponds to the Jukes-Cantor model ( Jukes fc CantqrL 1969). Of course, this 
model already appeared as a Lie Markov model with symmetry 64 in lSumner et al\ (|2012aT ). 
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1.1) Take £ = (B ld ) c . The stochastic cone has only one ray, spanned by B ld . Therefore, in 
this case, we only have a ray-orbit, which is is composed of only one orbit. We refer to it by the 
ray-orbit 148 (see Table E])- The generic stochastic rate matrix is 

/ * 1 1 1 \ 
1*11 
11*1 

y i i i * ) 



Dimension Two 



id © sgn We have [B\ d , B s « n ] = [B^ d , B s s n ] = 0, so for any couple a, b > with a + b = 1 and 



6^0, there is a well-defined Lie Markov model: 

£ = « 6 ,B^) c = id©sgn. 

Notice that the condition b ^ is needed to ensure that the dimension of the the stochastic 
cone is equal to the dimension of the Lie algebra. As claimed in Remark l5.ll these models 
are to be considered as submodels of the 3-dimensional model with decomposition 2idffisgn 
(see model 3.3a). 



id © Ci Since [B\ d , B Cl ] = [B^ d , B Cl ] = 0, we find that for any choice of a, b > with a + b = 1 



and b ^ 0, the space 



£=(^ d b ,B^) c = id©Ci 



provides a 2-dimensional Lie Markov model. These are submodels of the 3-dimension model 
with decomposition 2id + £i (see model 3.3b). 



id © (2 We find the same situation for this decomposition. The following Lie algebras will 
appear as submodels the 3-dimensional model indicated: 

• £ = (B ± a d b ,B^) c is a submodel of model 3.4; 

is a submodel of model 3.3c. As a special case of the last family of 



c 



models, when we take a = 1 and b — 0, we obtain the following model 
2.2a) £ = (B^ d ,B^ 2 ) c . The stochastic cone has 2 rays generated by £(12) and £(34), 
which form the ray-orbit 220 of Table [51 The general stochastic rate matrix of the 
model is 



/ * a \ 

a * 

* (3 

\ 0/3 * J 



a,f3>0. 



The reader may note that this model gives a reducible Markov chain, that is, it is 
not possible to get to some states from some other states. If we identify 1,2,3,4 
with A, G, C, T, we see that the purine states A and G communicate with each other, 
and the same for the pyrimidine states C and T (transitions) while no replacement 
between purines are pyrimidines (transversions) is allowed. 
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Apart from these models, our analysis of 1-dimensional Lie Markov models produces an- 
other 2-dimensional model with decomposition 2id (notice that this decomposition is allowed by 
Result 0). 



2id Of course, there is only one possible model with this decomposition: 

2.2b) £ = (Bj; d , B| rf ) c . If we focus on the stochastic rate matries, we find a cone with 2 
rays, corresponding to the following ray-orbits (see Table [6]): 

ray-orbit 140 = {i(i 2 )(34)}, ray-orbit 108 = {£(i 3 )( 2 4) + £(14)(23)}- 
The Lie algebra is abelian and the stochastic rate matrices for this model are given by 



f * a 0\ 

a * f3 (3 

/3 /3 * a 

\ (3 (3 a * ) 



a, 13 > 0. 



Notice that if we identify our ordered alphabet {1,2 .3,4} with {A , G, C, T), this model 
corresponds to the Kimura model with 2 parameters (|KimuraL flQSOlL 



Dimension Three 



2id<, 



sgn 



There is only one model with this decomposition: 
3.3a) We find that 

£=(Bi d ,B 2 d ,B s s n ) c , 

is a 3-dimensional abelian Lie Markov model. The stochastic cone has 3 rays in 2 ray-orbits: 
ray-orbit 140 = {i(i 2 )(34)}i ray-orbit 204a = {i(i 3 )(24), £(14)(23)}- 



(see Table E]). The general stochastic rate matrix is 

a,/3,7 > 0. 



/ * a /3 7 \ 

a * 7 (3 

j3 7 * a 

\ 7 (3 a * J 



Permutation basis of £: £(i 2 )(34)j £(13)(24), £(14)(23)- 

Of course, this is the Kimura model with 3 parameters (|Kimural . 1 198 II ). Note at the same 
time that this is the Z 2 x Z 2 group-based model corresponding to the copy Z 2 x Z 2 = 
<(13)(24),(14)(23)). 



2id © (i There is only one model with this decomposition: 



3.3b) We see that 



£ = <Bl d ,B 2 d ,B< 1 } { 
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is a 3-dimensional abelian Lie Markov model. The stochastic cone has 3 rays, in 2 ray- 
orbits: 

ray-orbit 140 = {£(i 2 )(34)}, ray-orbit 204b = {L(i 324 ), £(1423)}- 
The general stochastic rate matrix is 

/ * a [3 7 \ 

a . <*,/3,7>0. 

7 p * a 

\ [3 7 a * J 

This new model may be regarded as a twisted version of the Kimura model with three 
parameters. 

Permutation basis of £: £(i2)(34), £(1324): £(1423)- 

Note that this is the Z4 group-based model. Namely, it corresponds to the copy ((1324)) = 
Z 4 . 



2ic? © £2 There are two models with this decomposition: 



3.3c) Take£ = (B{ d , B", B^ 2 ) c , which is a 3-dimensional abelian Lie algebra. The stochas- 
tic cone has 3 rays, in 2 ray-orbits: 

ray-orbit 108 = {£(i 3 )(24) + £(14)(23)}, ray-orbit 220 = {L (12) , L (34) }. 

The general stochastic rate matrix is 

( * a [3 (3 \ 

* P /3 



/3 /3 * 7 
V £ /3 7 * 



a,/3, 7 > 0. 



Permutation basis of £: £(12), £(34), £(1324) + £(1423)- 

3.4) Take £ = (Bf ,B^,B« 2 ) C . The stochastic cone has 4 rays, in 3 ray-orbits: 

ray-orbit 108 = {£(i 3 )(24) + £(14)(23)}, 
ray-orbit 140 = {£(i 2 )(34)}, 
ray-orbit 224 = 

Notice this is the first example of a model with Q symmetry where the number of rays is 
bigger than the dimension of the stochastic cone. It is also the first case where the Lie 
algebra £ is not abelian: the Lie algebra structure is given by 

[£(13)(24) + £(14)(23), Rij] = Rkl - R tj> [£(12X34). R~tj] = 0, 

[£(13)(24) + £(14)(23), £(12)(34)] = [ R tj> R kl\ = R ij ~ R W 

for {ij, kl} = {12, 34}. The general stochastic rate matrix is 

/ * a + -f 13 + 7 ,3 + 7 \ 

a + 7 * P + l P + 7 

[3 + 8 [3 + 5 * a + S 

\ 13 + 6 13 + 5 a + 5 * / 

Permutation basis of £: £(i2)(34). -^12. -^34- 



a,p, 7 ,S>0. 
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Figure 3: The three figures represent transversal sections of the stochastic cone of the models 
3.4, 4.4b and 4.5a, respectively. The three of them are contained in the convex hull of the vectors 

■^(12); £(34)) Ll3 + -^24, -^14 + -^23; -^31 + -^42, ^32 + -^41- 

Dimension Four 



id( 



sgn< 



' £ Lie Markov models with this decomposition appear as special submodels of forth- 
coming models 5.7a, 5.76 and 5.7c with decomposition 2id© sgn©£, when we restrict the 
identity component of their Lie algebra £ to a subspace (B* d h ) c with a, 6 > 0. Because of 
this, we are not listing them here. The reader should take into account that depending on 
the values of a, 6 the number of rays of the cones of these models may vary. 



id 8 sgn e Ci © C2 



The models with this decomposition appear as special cases of model 5.6a 
with decomposition 2id © sgn © £1 © C2> when we restrict the identity component of £ to 
the subspace generated by some matrix B^ d b with a, b > (see the preceeding case). 



id © (2 © £ Similarly, these models are special cases of the models 5.66, 5.11a, 5.116, 5.11c and 
5.16 with decomposition 2id ffi (2 © £, when we restrict the identity component of £ to the 
subspace generated by some matrix B* d b with a, 6 > 0. 

If we request these models t o have 64 symme try, we obtain the restriction a = 6, leading 
to the Felsenstein 81 model (jFelsensteinl . 119811) : 

4.4a) Take £ = (B ld , B^ 2 , Bf , B|) c . The stochastic cone has 4 rays in one single ray-orbit: 
412a = {Ri, i?2, R3, R4}, and the Lie algebra structure is given by 

[Ri, Rj] — Ri — Rj. 

The general stochastic rate matrix is 



, a,/3,7,<5>0 



/ 


* 


a 


a 


a 




p 


* 


P 


P 




7 


7 


* 


7 


V 


6 


6 


5 


* 



Permutation basis of £: R%, R%, R3, R4. 
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2id © 2£ 2 There is only one model with this decomposition. 

4.4b) Take £ = (B\ d , B| d , B^ 2 , B^ 2 ) c . The stochastic cone has 4 rays, in 2 ray-orbits: 

ray-orbit 204c = {Chi 2 ,Ch 34 }, 
ray-orbit 220 = {L (12) , i (34) }. 

The Lie algebra is given by 

[£(12), £(34)] = 0, [£(12), Chy] = 0, 



[Chi 2 


,Ch 34 ] = 


2(Ch 34 


The general stochastic rate matrix 


is 




/ * a 


(3 (3 \ 




a * 


/3 /3 




7 7 


* <5 




\ 7 7 


5 * / 



a,^,7,5>0. 
Permutation basis of £: £(12), £(34), Chi2, Ch 34 . 



2 id © sgn © £ 2 There is only one model with this decomposition. 



4.5a) Take £ = (B^ d , B^, B sgn , B^ 2 ) c . The stochastic cone has 5 rays spanned, in 3 ray- 
orbits: 140, 204a and 224. The general stochastic rate matrix is 



/ * a + 5 (3 + 5 7 + (5 \ 

a + 5 * 7 + 5 (3 + 5 

j3 + e 7 + e * a + e 

\ 7 + £ 13 + e a + e * J 



Permutation basis of £: Rf 2 , R^ 4 , £(i3)(24), £(i4)(23)- 



a,(3,j,5,e > 0. 



2id © Ci © C2 There is only one model with this decomposition. 



4.5b) Take £ = (B{ d , B| d , B^ 1 , B? 2 ) c . The stochastic cone has 5 rays, in 3 ray-orbits: 140, 
204b and 224. Then, the general stochastic rate matrix is 



* a + 5 (3 + 5 j + 5 \ 

a + 5 * 7 + <5 (3 + 5 

7 + e (3 + e * a + e 

\ (3 + e -f + e a + e * J 



a,/3,7,a,e > 0. 



Permutation basis of £: Rf 2 , -R34, £(1324), £(1423)- 

Dimension Five 



2id © sgn © £ We obtain three Lie Markov models with this decomposition: 
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5.7a) Take £ = (B\ d , B 2 d , B sgn , Bf , B|) c . The stochastic cone has 7 rays in 3 ray-orbits: 
140 204a and 424a. The general stochastic rate matrix of the model is 



/ * a + 5 + e /3 + 5 + e 7 + S + e \ 

a + C + V * 1 + C + V /3 + C + V 

/3 + S + C j + S + C * a + S + C 

\ j + e + v /3 + e + v a + e + rj * J 



Permutation basis of £: £(i2)(34), 2-Ri + R3 + Ri + L(i3)(24), 2i? 2 + ^?3 + -R4 + ^(i3)(24), 2i? 3 + 
R\+ R2 + Z/(i4)(23), 2i?4 + R\ + i? 2 + i(14)(23)- 

5.7b) Take £ = {B\ d , B| d , B sgn , B|, B|) c . The stochastic cone has 7 rays, in 3 ray-orbits: 
140, 204a and 424b. The general stochastic rate matrix of the model is 



/ * a + C + V /3 + S + e 7 + (5 + e ^ 

a+S+s * 1 + C + V (3 + C + V 

/3 + S + C 1 + S + C * a + e + r] 

\^ + s + i] /3 + e + rj a + S + C * ) 



a,/3,j,5,e,C,V> 0- 



Permutation basis of £: £(i 2 )(34), ^Hi+H 3 +H4+L^ 13 ^ 2 4), 2.ff2+#3+-ff4+-£'(i3)(24), 2iJ 3 + 

H 1 + H 2 + L(i4)(23), 2^4 + Hi + H 2 + £(14)(23)- 

5.7c take £ = (Bj d , B| d , B sgn , B|, B^) { 



/c . The stochastic cone has 7 rays, in 3 ray-orbits: 140, 
204a and 424c. The general stochastic rate matrix of the model is 



/ * a + S + e (3 + 5 + C i + e + v \ 

a + C + V * J + S + C P + s + rj 

(3 + 5 + e j + C + V * a + S + C 

\ -y + S + e (3 + e + C a + e + n * ] 



Permutation basis of £: £(i 2 )(34), 2Vi + V3 + V4, + £(i3)(24), 2V2 + V 3 + V4 + £(i 3 )(24), 2V3 ■ 

Vl + V2 + ^(14)(23)) 214 + Vi + V2 + ^(14)(23)- 



2id © sgn © Ci © C2 We obtain a Lie Markov model with this decomposition: 



5.6a) Take £ = (B^ d ,B^ d ,B s s n ,B Cl ,Bf ) c . The stochastic cone has 6 rays, in 3 ray-orbits: 
204a, 204b and 220. The general stochastic rate matrix of the models is 



/ * ay+eS+C\ 

a * S + C 7 + £ 

7 + C S + e * (3 

\ S + e 7 + C 13 * 



a,/3,7,S,e,C > 0. 
Permutation basis of £: L(i 2 )(34), £(13)(24), £(14)(23), £(1324), £(1423)- 



2id( 



+2 



£ We obtain five Lie Markov models with this decomposition: 



5.6b) Take £ = (B| d , B| d , B^ 2 , B^, B|) c . The stochastic cone has 6 rays, in 3 ray-orbits: 
108, 140 and 412a. The general stochastic rate matrix of the model is 

/ * a + 7 /3 + 7 (3 + y \ 

a + 5 * (3 + S f3 + S 

[3 + e (3 + e * a + e 

\ f3 + C f3 + C a + C * j 



a,[3,j,5,e,C> 0. 
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Permutation basis of £: £(i2)(34)j Ri, R%, R3, Ri- 

This model can be regarded as the sum of the Felsenstein model ( Fclscnsteml 1981 ) and 
the Kimura model with 2 parameters. 

5.16) £ = (Bl d ,B 2 d ,B^,B«,B«) c . 

The stochastic cone has 16 rays, in 6 ray-orbits: 108, 140, 224, 416, 424c, 432. 
Permutation basis of £: i(i2)(34)j V\ + Chi2, V2 + Chi2, V3 + Ch34, V4 + Ch34. 

5.11a) £ = (Bj d ,B| d ,B« 2 ,B«,B«) c . 

The stochastic cone has 11 rays, in 4 ray-orbits: 108, 220, 424a, 428a. 

Permutation basis of £: £(i2)(34), 2i?i + CI134, 2i?2 + Ch34, 2i?3 + Chi2, 2i?4 + Chi2. 

5.11b) £=(Bj d ,Bi d ,B^,B«,B«) c . 

The stochastic cone has 11 rays, in 4 ray-orbits: 108, 220, 424b, 428b. 

Permutation basis of £: L(i2)(34)j 2H\ + Ch34, 2H2 + Ch34, 2H3 + Ch-12, 2iJ4 + Chi2. 

5.11c) £= (Bi d ,B| d ,Bf,B«,B«) c . 

The stochastic cone has 11 rays, in 4 ray-orbits: 108, 220, 424c, 428c. 

Permutation basis of £: L(i 3 )( 2 4) +£(14)(23)j 2Vi + Chi 2 , 2V" 2 + Chi 2 , 2V3 + CI134, 2V4 + CI134. 

Dimension Six 



2 id© sgn © £1 ffi£ We do not find any Lie algebra with this decomposition, so we conclude 



tha there are no Lie Markov models in this case. Notice that our methods do not allow to 
predict whether there actually are Lie Markov models for a given decomposition without 
explicit computation. We hope results and tools from Lie theory will assist us somehow in 
the future so we can avoid this explicit search of Lie algebras. 



2id © sgn © (1 © 2(2 We obtain only one Lie Markov model with this decomposition: 



6.6) Take £ = (Bj d , B| d , B sgn , B^ 1 , B'p, B!f) c . The stochastic cone has 6 rays, in 2 ray- 
orbits: 220 and 402c. The general stochastic rate matrix of the models is 

a,/3,7,<5,e,C > 0. 

Permutation basis of £: £(12), £(34), £13 + ^24, £14 + £23, £31 + £42, £41 + £32- 



/ * a /3 7 \ 

a * 7 /3 

8 e * e 

\ e S ( * / 



2id © sgn © £2 © £ We obtain two Lie Markov models with this decomposition. 



6.7a) The first model is £ = (B\ d , B^ d , B s s n , B^ 2 , B* , B^) c . The stochastic cone is generated 
by 7 rays, in 3 ray-orbits: 140, 204a and 412a. The general stochastic rate matrix is given 
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by 



/ * a + 5 (3 + 5 j + 5 \ 

a + e * 7 + e (3 + e 

P+C 7 + C * a+C 

\ -f + rj /3 + 7] a + T] * j 



Permutation basis of £: £(i3)(24), £(i4)(23)j Ri: R2, Ri- 



This model already appeared in ISumner et ali (|2012al ) under the name K3ST + F81 and 
it has 64 symmetry. Notice that the permutation basis just given is invariant under the 
action of Q, but not under the action of ©4. A permutation basis consi stent with the 



symme try 64 is given by the vectors Wij of Example [4] (see Result 17 of ISumner et al. 
(l2012ah ). 



6.17a) Take£ = (Bf, B| d , B 3 ^, B« 2 , B|, B|) c . 

The stochatic cone has 17 rays, in 6 orbits: 140, 204a, 224, 416, 424c, 432. 

Permutation basis of £: £(i3)(24), £(u)(23)> V\ + Chi 2 , V 2 + Chi 2 , V 3 + Ch 34 , V4 + Ch 34 . 



2id © Ci © C2 © £ We obtain three Lie Markov models with this decomposition: 



6.7b) Take £ = (Bj d , B^ d , B Cl , B C2 , Bl, B|) c . The stochatic cone has 7 rays, in 3 ray-orbits 
108, 204b and 412a. Then, the general stochastic rate matrix is 



/ * a + 5 (3 + 5 j + S \ 

a + e * 7 + e (3 + e 

7 + C /3 + C * a + C 

\ (3 + 77 7 + ?/ a + i] * ) 



, a,j3,j,S,e,C,r)>0. 



Permutation basis of £: £(1324), £(1423)1 Ri, R2, R3, R4 



6.17b) Take £ = (Bi d ,B^ d ,B^,B^,B^,B«) c . The stochatic cone has 17 rays, in 6 orbits 
140, 204b, 224, 416, 424c and 432 

Permutation basis of £: £(1324), ^(1423)) V\ + Chi 2 , V% + Chi 2 , V3 + Ch 34 , V4 + Ch 3 4. 



2id © 2(2 © £ We obtain two Lie Markov models with this decomposition. 



6.8a) Take £ = (B\ d ,B^ d ,B^,B^ 2 ,Bf ,B|) C . The stochatic cone has 8 rays, in 3 ray-orbits: 
204c, 220 and 412a. Then, the general stochastic rate matrix is 



/ * a + e 7+e 7+e\ 

a + C * 7 + C 7 + C 

5 + rj 8 + r] * (3 + X] 

\ 5 + 9 5 + 9 /3 + * ) 



, u,/3,j,5,e,(,V,8 >Q- 



Permutation basis of £: £(12)1 £(34), Ri, R2, R3, Ri- 

6.8b) Take£ = (B} d , B| d , B'p , B^ 2 , B|, B|) c< The stochatic cone has 8 rays, in 3 ray-orbits: 
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204c, 220 and 412b. Then, the general stochastic rate matrix is 

/ * a + e 7 + 77 7 + # \ 

a + £ * 7 + fy 7 + 6 

S + e S + C * fj + rj 

V S + e S + C (3 + * 



Permutation basis of £: £(12), -^(34)1 Vi + Ch.12, V2 + Ch.12, V3 + Ch34, V4 + Ch34. 



Dimension Eight 



id © sgn © (1 © £ 2 © 2£ The models with this decomposition appear in a family as special cases 



of model 9.20a), when we restrict the identity component of £ to the subspace generated 
by some vector B^ b with a,b > 0. 



2id © 2(2 © 2£ We obtain two Lie Markov models with this decomposition. 



8.8) Take £ = (Bf,B^ d ,Bf 2 ,B| 2 ,Bf,B|,Bf,B£) c . The stochastic cone has 8 rays, in 2 
ray-orbits 402a and 410. Then, the general stochastic rate matrix is 

/ * a 0\ 
7 * 5 S 
e e * £ 
\ n r\ 9 * J 

Permutation basis of £: £(12) + H\, £(12) + #2, ^(34) + -^3> ^(34) + Ha, R\, R 2 , R3, Ra- 

8.16) Take £ = (Bj d , B^ d , B^ 2 , B^ 2 , B{ , B|, B^, b|) c . The stochastic cone has 16 rays, in 
5 ray-orbits: 204c, 220, 404a.412a and 412b. Permutation basis of £: Chi 2 + Vi,Chi 2 + 

V 2 , CI134 + V3, Ch34 + V4, Ri, i?2, -R3; Ra- 



2id © sgn © Ci © 2£ 2 © £ We obtain two Lie Markov models with this decomposition: 



8.10a) Take £ = (B j d , B| d , B sgn , B^ 1 , B^ 2 , B^ 2 , B^, B|) c . The stochastic cone has 10 rays, 
in 3 ray-orbits: 220, 402c and 412a. 

8.10b) Take £ = (Bj d ,Bi d ,B s « n ,B^,B^ 2 ,B^ 2 ,B|,B«) c . The stochastic cone has 10 rays, 
in 3 ray-orbits: 220, 402c and 412b. 



2 id © sgn © £2 © 2£ We obtain two Lie Markov models with this decomposition. 



8.17) Take £ = (Bj d , B| d , B s s n , B^ 2 , B{, B|, B|, B|) c . The stochastic cone has 17 rays, in 5 
ray-orbits: 140, 402b, 412a, 422a and 432. 

Permutation basis of £: 2Vi + V 3 + V 4 + L ( i 3 )( 2 4), 2V 2 + V 3 + V 4 + £(13)(24), 2V 3 + Vi + V 2 + 

^(14)(23)> 2V4 + V1 + V2 + L(i4)(23), -Rl, i?2, -R3, -^4- 

8.18) Take £ = (B\ d , B^ d , B s s n , B? 2 , B{, B|, BI, B|) c . The stochastic cone has 18 rays, in 5 
ray-orbits: 204a, 404b, 412a, 412b, 420. 

Permutation basis of £: 2Hi + H 3 + H 4 + £(i3)(24), 2i7 2 + H 3 + Ha + L( 13 )( 24 ), 2iJ 3 + H\ + 

H 2 + i(14)(23)> 2i?4 + Hi + H 2 + L( 14 )( 23 ), -Ri, i?2, -R3, Ra- 
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Dimension Nine 



2id © sgn © d © £2 © 2£ We obtain two Lie Markov models with this decomposition. 



9.20a) Take £ = {B\ d , B| d , B sgn , , B^ 2 , B^, B^, B|, B|} c . The stochastic cone has 20 
rays, in 3 ray-orbits: 410, 204a, 204b, 404b and 804. 

9.20b) Take £ = (B{ d , B^ d , B sgn , B^ 1 , B| 2 ,B|, b|, B|, B|) c . The stochastic cone has 20 
rays, in 5 ray-orbits: 204b, 220, 402d, 422b and 812b. 

Dimension Ten 



2id © sgn © (1 © 2(2 © 2£ We obtain two Lie Markov models with this decomposition. 



10.12) Take £ = (Bj d , B| d , B sgn , B^ 1 , B^ , B^ 2 , Bf , B|, B|, b|) c . The stochastic cone has 12 
rays, in 3 ray-orbits: 402a, 402c and 410. 

10.34) Take £ = (Bf, B| d , B sgn , B^ 1 , B^ , B c 2 2 , Bf , B|, b|, b|) c . The stochastic cone has 34 
rays, in 7 ray-orbits: 220, 402b, 402c, 412a, 412b, 812a and 822. 

Dimension Twelve 



2 id © sgn © £1 © 2(2 © 3£ For completion, we include the general Markov model. 



12.12) Take the model £ = (B\ A , B| d , B sgn , B^ 1 , B^ 2 , B^ 2 , B^, B|, B|, b|, B^, B^) c . The 
stochastic cone has 12 rays, in 2 ray-orbits: 410 and 801. The general stochastic rate 
matrix is 

/ * a 

a, P, 7, S, e, C, r),9, *&, l, k, A > 0. 



77 v 

L K 



P 7 \ 
e C 

A * j 



Permutation basis of £: L 12 , L 13 , L 14 , L 2 i, L 23 , L 24 , L 31 , L 32 , L 34 , L 4 i, L 42 , L 43 . 
Remark 5.2. As already notices in Remark l5.ll a number of models in the above list have more 



symme tries than those requested by the group Q, and they already appeared in ISumner et al. 
(|2012al ) as Lie Markov models with ©4 symmetry. Namely, 



Jukes-Cantor model (1.1) 

Kimura 3ST model (3.3a) 

Felsenstein 81 model (4.4a) 

Kimura 3ST+ Felsenstein 81 (6.7a) 

General Markov model (12.12) 

For these models, the decomposition into irreducible representations of Q can be obtained from 
the decompositio n into irreducible repre sentations of 64 by applying the branching rule of Table 
[5] (cf. Table 2 of lSumner et all (|2012al )). Since there are no subgroups between Q and ©4, we 
can conclude that the rest of models listed here do not have further symmetries. 
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Remark 5.3. A vector subspace £ in &gmm is a matrix algebra if its multiplicatively closed, 
that is, if the product XY lies in £ for any couple X,Y E £,. Of course, this condition is stronger 
than the request of being a Lie algebra. The reader may wonder which of the models in the 
above list are actually algebras. It was surprising to check that the only Lie algebras which are 
not algebras correspond to the Lie Markov models that appear in families depending on some 
parameters a, b in the sense of Remark 15.11 that is, the Lie Markov models corresponding to the 
following decompositions: 

dimension 1: id; 

dimension 3: id © sgn, id © &, id © £ 2 ; 

dimension 4: id © (2 © £, id © sgn © £, id © sgn © Ci © C2; 

dimension 8: id © sgn © Ci © C2 © £• 

Notice that these decompositions correspond exactly to those decomposition of Table |3] that arise 
as irreducible permutation representations (<S/H)c for a subgroup H of Q. 



6 Discussion 



Following the ideas o f S umner et all (|2012aj ). in this paper we have discussed Lie Markov models 



with Q = ((12), (1324)) = 62 I&2 symmetry. Our main motivation is that this symmetry may be 
of special interest to the practicing biologist who wishes to deal with models preserving specific 
grouping of nucleotides into purines and pyrimidines. In Section [2] we recalled some of the basic 
definitions on Lie Markov models and the required tools arising from representation theory of 
groups that will be useful later on the paper. At the same time, we introduce a new concept 
which is the stochastic cone of a Lie Markov model, being the set of stochastic rate matrices 
of the Lie Markov model. In Section [3] we explained how to derive Lie Markov models with 
prescribed symmetry and discussed the geometry of the corresponding cone of stochastic rate 
matrices. In Section [4] we took the permutation group Q and decomposed the space of all rate 
matrices into irreducible modules of Q and provided a basis consistent with this decomposition. 
In Section [5] we gave the whole list of all Lie Markov models with Q symmetry, arranged by their 
dimension. 

In Section [5] we defined evolutionary models from a rate matrix perspective as some well- 
defined linear subspaces of the 12-dimension space £gmm of all rate matrices. We could focus 
on the substitution matrices inste ad, and keeping in mind the importance of substitution matrices 
being multiplicatively closed (see Sumner et al. ( 2012al) ). define "evolutionary model" as some 



well-defined groups 9Jt of matrices in M n (M). Then, when we restrict to the stochastic setting, 
we are lead to consider the intersection of 9Jt with the stochastic polytope: 

P sto = I M = (my) € M I my > 0, my = 1 j . 

The reader may note that P s t is a compact polytope with the identity matrix in one of the 
vertices. This polytope is cut into several connected components by the algebraic hypersurface of 
equation det(M) = 0. From a biological point of view, we are mainly interested in the connected 
component that contains the identity matrix. This is because by continuity arguments, it can be 
shown that this connected component contains the exponential of the stochastic rate matrices 
of the model. In this paper, we have preferred to introduce evolutionary models from the point 
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of view of rate matrices because both the definition of Lie Markov models and the procedure to 
construct them appear in a natural way in this setting. However, the connection between rate 
matrices and substitution models is not completely clear, and it deserves further attention. An 
interesting question on this issue is whether the image of the exponential map restricted to the 
stochastic cone covers the whole connected component of the identity. We want to explore this 
question in the future to clarify the connection between substitution and rate matrices. 

Although we hav e kept the original definition of symmetry for a Lie Markov model of 



Sumner et all (|2012af ). an interesting question arises if one tries to expand this definition. Namely, 
we could investigate Lie Markov models which are invariant under the action of some permuta- 
tion subgroup G of 64 (as suggested in Remark l3~5l) without the additional request that they 
have a permutation basis. Since the action of Q is linear, this would lead to strongly convex 
polyhedral cones. From an applied point of view, we do not find any particular reason not to 
consider this expanded definition, which would lead to a huge number of possible models. For 
example, we would admit the complex span of the ray orbits 424a, 4246 and 424c (see Table 0]) 

£ = (i2 13 , R u , R23, R24,)c 

£ = (#13 > H 14 > H 23 > H 2i)C 

£ = (v+,v+,v+,v+) c 



as models with symmetry Q and decomposition id © £. The reader may note that this decom- 
position does not appear in the list of Table |H More interestingly, it is not hard to show that 
the set of doubly stochastic rate matrices has ©4 symmetry under this expanded definition, and 
moreover forms a Lie algebra. The authors keep back this line of research for future publication. 
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H < Q 


Copies 


\Q\/\H\ 


Decomposition of (Q/U^q 


Pictorial representation 


W 


1 


8 


id ffi sgn ffi Ci ffi C2 ffi 2£ 


•l 3»- *3 *1 — >- *4 *2 >■ *3 *2 >- ^4 


e 2 = <(12)>,<(34)> 


2 


4 


id © C2 e £ 


•l «2 «3 »4 


62 = <(14)(23)), <(13)(24)> 


2 


4 


id ffl sgn ffi £ 


•l »3 »1 »4 

•2 «3 *2 »4 


62 = ((12)(34)> 


1 


4 


id ffi sgn ffi Ci ffi C2 


•l »3 »2 «3 »1 »4 «2 »4 

Yi YL )-( )-( 

/ \ A \ A \ A \ 
•2 «4 »1 «4 »2 »3 »1 »3 


Z 4 = ((1324)} 


1 


2 


idffiCi 


•l «1 
•3 »4 »4 «3 
•2 «2 


62 x & 2 = ((12), (34)) 


1 


2 


id ffi £2 


•l «2 «3 «4 


62 x 62 = ((12)(34),(13)(24)) 


1 


2 


id sgn 


•l »2 »1 »2 

\ / \ / 

/ \ / \ 
.3 .4 .4 .3 


g 


1 


1 


id 


•l »3 

\ / 
/ \ 
. 2 . 4 



Table 4: Decomposition of the orbits of Q into irreducible modules. The 5th column exhibits the orbits Q /H in terms of 2-dimcnsional 
graphs. 
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Table 5: The Lie brackets of the basis given for &gmm hi Result |4j The entries not included are easily determined by applying the rule 
[X,Y] = -[Y,X]. 



orbit 


matrices 


dec. 


as an abstract set / dec. 


in £ 




140 


L (12)(34) 




id 












£(13)(24) + £(14)(23) 




-i A 










1 A Q 


£(12)(34) + £(13)(24) + £(14)(23) 




id 










204a 


£(13)(24), £(14)(23) 




idS 


sgn 








2046 


£(1324), £(1423) 




id s 


Ci 








204c 


Ch 12 ,Ch 34 




id« 


C2 








220 


£(12), £(34) 




idS 


C2 








224 


R+ R+ 

12 ' 34 




idS 


C2 








402a 


^13 "1" ^14; ^23 H" ^24; ^31 H~ ^32 1 ^41 4" ^42 




idS 




c 






402b 


-^13 ~)~ ^42 ; ^23 ~\~ L41 , Z/31 -f- X/24 1 ^32 ~h -^14 




id 


sgn 


s 






402c 


Z/i ^ -4- Li a . Lo% -\- L\A. L^ka -\- Lao . L ^9 -1- 




idff 


sgn 


S Ct ffi C 2 






402d 


-^(13) j -^(14) 5 -^(23) 1 -^(24) 




id 6 


C2 s 








404a 


Vi + H 2 - 2L {12) , V 2 + Hi - 2L (12) , V 3 + H 4 - 2L (34) , V 4 + # 3 - 2L (34) 




idff 


C2 s 


k 1 id © i 






4046 


H+ - (L21 + L43), H+ - (L21 + L 34 ), H+ - (L12 + L43), H+ - (L12 + L34) 




id 6 


C2 s 


i 1 id e £ 






410 


£l2, ^21i ^34i £43 




idff 


C2 s 


£ / id e £ 






412a 


i?l,-R2,i?3,-R4 




idff 


C2 s 


§ / id © £ 






4126 


Vi,V 2 ,V 3 ,V 4 




id 6 


C2 s 


i / id e £ 






412c 


H 1 ,H 2 ,H :i ,H 4 




id« 


C2 s 


1 / id © £ 






416 


Vi + Chia, V 2 + Chia, V 3 + Ch 34 , Vi + Ch 34 




id« 


C2 s 


5 / id © i 






420 


£12 + £34, £12 + £43; £21 + £34, £21 + £43 




idS 


C2S 


t 1 id © i 






422a 


R+ - (L14 + L32), #f 4 - (L13 + L42), #J 3 - (£24 + £3l), #24 - ( L 23 + £41) 




idS 


C2 s 








4226 


£(1234) i £(1243) i £(1324) > £(1342) 




idS 


sgn 


©« 






424a 


#13' #14' #23' #24 
rr+ EJ+ E7t £7+ 
■"13' 11 l&i ^23' "24 




idS 


sgn 


© 5 / id © £ 






4246 




idS 


sgn 


© 5 / id © £ 






424c 


v+ 1/ + V+ v + 

K 13> v 14> v 23' K 24 




idS 


sgn 


© £ / id © (, 






428a 


2Ri + Ch 34 , 2i? 2 + Ch 34 , 2R 3 + Chi 2 , 2R 4 + Chi 2 




idS 


C2 S 


£ 






4286 


2Hi + Ch 34 , 2H 2 + Ch 34 , 2H 3 + Ch 12 , 2H 4 + Ch 12 




idS 


C2 s 


£ 






428c 


2Vi + Chi2, 2V 2 + Chi2, 2V 3 + Ch 34 , 2V 4 + Ch 34 




idS 


C2S 


£ 






432 


Vi + £(34) > V2 + L(34) , V 3 + £(12) , V4, + i(12) 




idS 


C2 s 


£ 






801 


il3, £l4, ^23, £24, L31, L32, 1/41, ^42 




idS 


sgn 


© Ci © C2 S 2£ 






804 


£(13)(24) - £l3 + £23, £(13)(24) - £24 + £l4,X(13)(24) ~ £31 + £41, £(1 3 )(24) ~ 


L 42 + 


idS 


sgn 


B Ci © C2 © 2£ / id a 


) sgn 


BCi ©£ 




£32, £(14)(23) — £l4 +£24, £(14)(23) — £23 +£l3, £(14)(23) — £32 + £ 4 2, £(14)(23) ~~ £41 +£31 












812a 


Ri — L14 + L41, Ri — L\a + L31, R 2 — L 24 + L 42 , R 2 — L 2 z + L 3 2, -R 3 — L 3 i + Li 3 , R 2 - 
L 2 3,R 4 — L 4 i + £14, #4 — £42 + £24 


-L32 + 


idS 


sgn 


© Ci © C2 © 2£/ id S 


Ci © 


C2 ©2£ 


8126 


£(123) , £(124) > £(132) ' £(134) , £(142) , £(143) , £(234) 1 £(243) 




idS 


sgn 


© Ci © C2 © 2£/ id S 


Ci © 


C2 ©2£ 


822 


£l2+£34+2Ll 3 , Li2+L 3 4+2L 3 l, L12+-L43+2L14, L12+L43+2L4I, -I<21+-I<34 + 2L23, -^21 + 


idS 


sgn 


© Ci © C2 © 2£/ id S 


Ci © 


C2 ©2£ 




L34 + 2L 32 , L 2 i + L 43 + 2L 24 , L 2 \ + L 43 + 2L 42 















Table 6: Ray orbits of the Lie Markov models with Q symmetry with the corresponding generators. The 3rd column describes the 
decomposition into irreducible representations of both the abstract set generated by these orbits and the subspace of £gmm spanned 
by them (see Remark l3.4l) . When both decompositions are equal, we write it down only once. 



